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Abstract 

The vertex cover problem is one of the most important and intensively studied combina¬ 
torial optimization problems. Khot and Regev [32, 33] proved that the problem is NP-hard to 
approximate within a factor 2 — e, assuming the Unique Games Conjecture (UGC). This is tight 
because the problem has an easy 2-approximation algorithm. Without resorting to the UGC, 
the best inapproximability result for the problem is due to Dinur and Safra [17, 18]: vertex 
cover is NP-hard to approximate within a factor 1.3606. 

We prove the following unconditional result about linear programming (LP) relaxations of 
the problem: every LP relaxation that approximates vertex cover within a factor 2 — e has super- 
polynomially many inequalities. As a direct consequence of our methods, we also establish 
that LP relaxations (as well as SDP relaxations) that approximate the independent set problem 
within any constant factor have super-polynomial size. 
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1 Introduction 


In this paper we prove tight inapproximability results for vertex cover with respect to linear 
programming relaxations of polynomial size, vertex cover is the following classic problem: given 
a graph G = (V, E) together with vertex costs c v > 0, v e V, find a minimum cost set of vertices 
U c V such that every edge has at least one endpoint in U. Such a set of vertices meeting every 
edge is called a vertex cover. 

It is well known that the LP relaxation 

min ^ c v x v 

veV 

s.t. x u + x v > 1 
0 < x v < 1 


iuv <E E 
VueV 


( 1 . 1 ) 


approximates vertex cover within a factor 2. (See e.g., Hochbaum [27] and the references therein.) 
This means that for every cost vector there exists a vertex cover whose cost is at most 2 times the 
optimum value of the LP. In fact, the (global) integrality gap of this LP relaxation, the worst-case 
ratio over all graphs and all cost vectors between the minimum cost of an integer solution and the 
minimum cost of a fractional solution, equals 2. 

One way to make the LP relaxation (1.1) stronger is by adding valid inequalities. Here, a valid 
inequality is a linear inequality Y, v eV a v x v ^ j6 that is satisfied by every integral solution. Adding all 
possible valid inequalities to (1.1) would clearly decrease the integrality gap all the way from 2 to 
1, and thus provide a perfect LP formulation. However, this would also yield an LP that we would 
not be able to write down or solve efficiently. Hence, it is necessary to restrict to more tangible 
families of valid inequalities. 

For instance, if C c V is the vertex set of an odd cycle in G, then JL,ec x v > is a valid 
inequality for vertex covers, known as an odd cycle inequality. However, the integrality gap remains 
2 after adding all such inequalities to (1.1). More classes of inequalities are known beyond the 
odd cycle inequalities. However, we do not know any such class of valid inequalities that would 
decrease the integrality gap strictly below 2. 

There has also been a lot of success ruling out concrete polynomial-size linear programming 
formulations arising from, e.g., the addition of a polynomial number of inequalities with sparse 
support or those arising from hierarchies, where new valid inequalities are generated in a system¬ 
atic way. For instance, what about adding all valid inequalities supported on at most o(n) vertices 
(where n denotes the number of vertices of G), or all those obtained by performing a few rounds of 
the Lovasz-Schrijver (LS) lift-and-project procedure [39]? In their influential paper Arora, Bollobas 
and Lovasz [2] (the journal version [3] is joint work with Tourlakis) proved that none of these broad 
classes of valid inequalities are sufficient to decrease the integrality gap to 2 - e for any e > 0. 

The paper of Arora et al. was followed by many papers deriving stronger and stronger tradeoffs 
between number of rounds and integrality gap for vertex cover and many other problems in 
various hierarchies, see the related work section below. The focus of this paper is to prove lower 
bounds in a more general model. Specifically, our goal is to understand the strength of any 
polynomial-size linear programming relaxation of vertex cover independently of any hierarchy 
and irrespective of any complexity-theoretic assumption such as e.g., P 4- NP. 

We will rule out all possible polynomial-size LP relaxations obtained from adding an arbitrary 
set of valid inequalities of polynomial size. By "all possible LP relaxations", we mean that the 
variables of the LP can be chosen arbitrarily. They do not have to have to be the vertex-variables 
of (1.1). 
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Contribution 

We consider the general model of LP relaxations as in [13], see also [10]. Given an n-vertex graph 
G = (V, E), a system of linear inequalities Ax > b in R , where d 6 IN is arbitrary, defines an LP 
relaxation of vertex cover (on G) if the following conditions hold: 

Feasibility: For every vertex cover U c V, we have a feasible vector x u e lR d satisfying Ax u > b. 

Linear objective: For every vertex-costs c € R+, we have an affine function (degree-1 polynomial) 
f c : R d ^R. 

Consistency: For all vertex covers U c V and vertex-costs c € R+, the condition f c (x u ) - YjveU c v 
holds. 

For every vertex-costs c e R+, the LP min[/ c (x) | Ax > b\ provides a guess on the minimum cost of 
a vertex cover. This guess is always a lower bound on the optimum. 

We allow arbitrary computations for writing down the LP, and do not bound the size of the 
coefficients. We only care about the following two parameters and their relationship: the size of the 
LP relaxation, defined as the number of inequalities in Ax > b, and the (graph-specific) integrality 
gap which is the worst-case ratio over all vertex-costs between the true optimum and the guess 
provided by the LP, for this particular graph G and LP relaxation. 

This framework subsumes the polyhedral-pair approach in extended formulations [8]; see 
also [43]. We refer the interested reader to the surveys [15, 28] for an introduction to extended 
formulations; see also Section 4 for more details. 

In this paper, we prove the following result about LP relaxations of vertex cover and, as a 
byproduct, independent set . 1 2 

Theorem 1.1. For infinitely many values of n, there exists an n-vertex graph G such that: (i) Every 
size-n°( log,! / loglogM ) LP relaxation of vertex cover on G has integrality gap 2 - o(l); (ii) Every size- 
n o(iog n/ log log n) pp relaxation of independent set on G has integrality gap co( 1). 

This solves an open problem that was posed both by Singh [51] and Chan, Lee, Raghavendra 
and Steurer [13]. In fact, Singh conjectured that every compact (that is, polynomial size), symmetric 
extended formulation for vertex cover has integrality gap at least 2—e. We prove that his conjecture 
holds, even if asymmetric extended formulations are allowed. 

Our result for the independent set problem is even stronger than Theorem 1.1, as we are 
also able to rule out any polynomial size SDP with constant integrality gap for this problem. 
Furthermore, combining our proof strategy with more complex techniques we can prove a result 
similar to Theorem 1.1 for ^-Uniform-Vertex-Cover (that is, vertex cover in ^-uniform hypergraphs), 
for any fixed q > 2. For that problem, every size ;i"( log ' !,/|oglog ' ! ) LP relaxation has integrality gap 
q - o(l). This generalizes our result on (graph) vertex cover. 

In the general model of LP relaxations outlined above, the LPs are designed with the knowledge 
of the graph G = (V, E); this is a non-uniform model as the LP can depend on the graph. It captures 
the natural LP relaxations for vertex cover and independent set whose constraints depend on 
the graph structure. This is in contrast to previous lower bound results ([8, 11, 9]) on the LP 
formulation complexity of independent set, which are of a uniform nature: In those works, the 

1 Recall that an independent set (stable set) in graph G = (V, E) is a set of vertices I Q V such that no edge has both 
endpoints in I. independent set is the corresponding maximization problem: given a graph together with a weight for 
each vertex, find a maximum weight independent set. 

2 Note that in some cases imposing symmetry is a severe restriction, see Kaibel, Pashkovich and Theis [29]. 
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formulation of the LP relaxation was agnostic to the input graph and only allowed to depend on 
the number of vertices of the graph; see [10] for a discussion of uniformity vs. non-uniformity. 
In general non-uniform models are stronger (and so are lower bounds for it) and interestingly, 
this allows for stronger LP relaxations for independent set than NP-hardness would predict. This 
phenomenon is related to the approximability of problems with preprocessing. In Section 5, we 
observe that a result of Feige and Jozeph [21] implies that there exists a size-O(n) LP formulation 
for approximating independent set within a multiplicative factor of 0( y/n). 

Related work 

Most of the work on extended formulations is ultimately rooted in Yannakakis's famous paper [54, 
55] in which he proved that every symmetric extended formulation of the matching polytope 
and (hence) TSP polytope of the n-vertex complete graph has size 2°^. Yannakakis's work was 
motivated by approaches to proving P = NP by providing small (symmetric) LPs for the TSP, 
which he ruled out. 

The paper of Arora et al. [2, 3] revived Yannakakis's ideas in the context of hardness of ap¬ 
proximation and provided lower bounds for vertex cover in LS. It marked the starting point for 
a whole series of papers on approximations via hierarchies. Shortly after Arora et al. proved that 
performing 0(log n) rounds of LS does not decrease the integrality gap below 2, Schoenebeck, 
Trevisan and Tourlakis [48] proved that this also holds for o(n) rounds of LS. A similar result holds 
for the stronger Sherali-Adams (SA) hierarchy [49]: Charikar, Makarychev and Makarychev [14] 
showed that Q (n°) rounds of SA are necessary to decrease the integrality gap beyond 2 - e for 
some 5 - 5(e) > 0. 

Beyond linear programming hierarchies, there are also semidefinite programming (SDP) hi¬ 
erarchies, e.g., Lovasz-Schrijver (LS+) [39] and Sum-of-Squares/Lasserre [42, 35, 36]. Georgiou, 
Magen, Pitassi and Tourlakis [24] proved that 0( yj log «/ log log n) rounds of LS+ does not ap¬ 
proximate vertex cover within a factor better than 2. In this paper, we focus mostly on the LP 
case. 

Other papers in the "hierarchies" line of work include [16, 23, 47, 34, 44, 53, 30, 1, 6]. 

Although hierarchies are a powerful tool, they have their limitations. For instance, o(n) rounds 
of SA does not give an approximation of knapsack with a factor better than 2 [30]. However, for 
every e > 0, there exists a size-n 1 ' £+ °( 1 ) LP relaxation that approximates knapsack within a factor 
of 1 + e [7], 

Besides the study of hierarchy approaches, there was a distinct line of work inspired directly 
by Yannakakis's paper that sought to study the power of general (linear) extended formulations, 
independently of any hierarchy, see e.g., [45, 22,8,6,11,9, 46]. Limitations of semidefinite extended 
formulations were also studied recently, see [12, 38]. 

The lines of work on hierarchies and (general) extended formulations in the case of constraint 
satisfaction problems (CSPs) were merged in the work of Chan et al. [13]. Their main result 
states that for Max-CSPs, SA is best possible among all LP relaxations in the sense that if there 
exists a size-;Y LP relaxation approximating a given Max-CSP within factor a then performing 
2 r rounds of SA would also provide a factor-n approximation. They obtained several strong LP 
inapproximability results for Max-CSPs such as max cut and max 3-sat. This result was recently 
strengthened in a breakthrough by Lee, Raghavendra, and Steurer [38], who obtained analogous 
results showing (informally) that the Sum-of-Squares/Lasserre hierarchy is best possible among 
all SDP relaxations for Max-CSPs. 

Braun, Pokutta and Zink [10] developed a framework for proving size lower bounds on LP 
relaxations via reductions. Using [13] and FGLSS graphs [20], they obtained a H Q ( log U loglog ”) size 
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lower bound for approximating vertex cover within a factor of 1.5 - £ and independent set within 
a factor of 2 - e. Our paper improves these inapproximability factors to a tight 2 - £ and any 
constant, respectively. 

Outline 

The framework in Braun et al. [10] formalizes sufficient properties of reductions for preserving in¬ 
approximability with respect to extended formulations / LP relaxations; this reduction mechanism 
does not capture all known reductions due to certain linearity and independence requirements. 
Using this framework, they gave a reduction from max cut to vertex cover yielding the aforemen¬ 
tioned result. 

A natural approach for strengthening the hardness factor is to reduce from unique games 
instead of max cut (since vertex cover is known to be unique GAMES-hard to approximate within 
a factor 2 - £). However, one obstacle is that, in known reductions from unique games, the optimal 
value of the obtained vertex cover instance is not linearly related to the value of the unique games 
instance. This makes these reductions unsuitable for the framework in [10] (see Definition 4.3). 

We overcome this obstacle by designing a two-step reduction. In the first step (Section 3), we 
interpret the "one free bit" PCP test of Bansal and Khot [4] as a reduction from a unique games 
instance to a "one free bit" CSP (1F-CSP). We then use the family of SA integrality gap instances for 
the unique games problem constructed by Charikar et al. [14], to construct a similar family for this 
CSP. This, together with the main result of Chan et al. [13] applied to this particular CSP, implies 
that no size-n°( log ”/ loglog ”) LP relaxation can provide a constant factor approximation for 1F-CSP. 
In the second step (Section 4), a reduction from 1F-CSP to vertex cover, in the framework of Braun 
et al. [10], then yields our main result. 

Finally, following a slightly different and more challenging route we prove tight hardness of 
approximation for LP relaxations of c/-U n i form-Vertex-Cover for every q > 2. This is done in 
Section 6. 

2 Preliminaries 

We shall now present required tools and background. In Sections 2.1 and 2.2 we define the class 
of constraint satisfaction problems and the Sherali-Adams (SA) hierarchy, respectively. 

2.1 CONSTRAINT SATISFACTION PROBLEMS 

The class of constraint satisfaction problems (CSPs) captures a large variety of combinatorial 
problems, like max cut and max 3-sat. In general, we are given a collection of predicates V = 
[Pi,..., P m ] (or constraints C = [Ci,..., C m }) where each P, is of the form P ; - : [R]' J h-> [0,1], where 
[R] := [1, ...,R] is the domain and n is the number of variables. We will be mainly interested 
in the family of CSPs where each predicate P is associated with a set of distinct indices Sp = 
fa,..., 4} c [»] and is of constant arity k, i.e., P : [R] ,c [0,1}. In this terminology, for x 6 [R]” we 

set P(x) := P(t (| ,..., x H: ). The goal in such problems is to find an assignment for x € [R] n in such a 
way as to maximize the total fraction of satisfied predicates. 

The value of an assignment x € [R]' ! for a CSP instance I is defined as 

1 m 

Valj(x) := - V P,(x) = E [P(x)] f 
m peP 

1=1 
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and the optimal value of such instance I, denoted by OPT(J) is 

OPT(J) = max Valj(x). 

xe[R]" 

Often, we will consider binary CSPs, that is, with domain size R = 2. Given a binary predicate 
P : {0, l} k i—* {0,1}, the/fee bit complexity of P is defined to be log 2 (|{z 6 {0, l} k : P(z) = 1}|). For 
example the max cut predicate X; © Xj has a free bit complexity of one, since the only two accepting 
configurations are (x, = 0, Xj - 1) and (x, = 1, Xj = 0). 

For the LP-hardness of vertex cover and independent set (i.e.. Sections 3 and 4), we will be 
interested in a one free bit binary CSP, that we refer to as 1F-CSP, defined as follows: 

Definition 2.1 (1F-CSP). A IF-CSP instance ofarity k is a binary CSP over a set of variables {x\,.. .,x n } 
and a set of constraints C = {Ci,..., C m \ such that each constraint C 6 C is of arity k and has only 
two accepting configurations out of the 2 k possible ones. 

2.2 Sherali-Adams Hierarchy 

We define the canonical relaxation for constraint satisfaction problems as it is obtained by 
r-rounds of the Sherali-Adams (SA) hierarchy We follow the notation as in e.g., [25]. For com¬ 
pleteness we also describe in Appendix A why this relaxation is equivalent to the one obtained 
by applying the original definition of SA as a reformulation-linearization technique on a binary 
program. 

Consider any CSP defined over n variables X\,, x n E [R], with a set of m constraints C - 
{Ci ,... ,C m {where the arity of each constraint is at most k. Let S; = Sq denote the set of variables that 
Q depends on. The r-rounds SA relaxation of this CSP has a variable X(s,«) for each S C [;/], a E [R] s 
with |S| < r. The intuition is that X(s, a ) models the indicator variable whether the variables in S are 
assigned the values in a. The r-rounds SA relaxation with r ^ k is now 

1 m 

maX mTj Tj Cl{a) ' X ( s "“) 

*=1 ae [ R] s i 

s.t. X ( su{j\ AOU ) = X { s,a) vs C [n\ : |S| < r, a E [R] s , / £ [//] \ S , (2.1) 

»e[R] 

X (S ,a) >0 VS c [n] : |S| < r, a E [R] s , 

X (0,0) = 1 • 

Here we used the notation (S U {/}, a o u) to extend the assignment a to assign u to the variable 
indexed by j. Note that the first set of constraints say that the variables should indicate a consistent 
assignment. 

Instead of dealing with the constraints of the Sherali-Adams LP relaxation directly, it is simpler 
to view each solution of the Sherali-Adams LP as a consistent collection of local distributions over 
partial assignments. 

Suppose that for every set S c [n] with |S| < r, we are given a local distribution T)(S) over 
[R] s . We say that these distributions are consistent if for all S' c S c [n] with |S| < r, the marginal 
distribution induced on [R] s ' by D(S) coincides with that of D(S'). 

The equivalence between SA solutions and consistent collections of local distributions basically 
follows from the definition of (2.1) and is also used in [14] and [13] that are most relevant to our 
approach. More specifically, we have 
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Lemma 2.2 (Lemma 1 in [25]). If {2)(S)}sc[n] : |S|<r * s a consistent collection of local distributions then 


X (S,a) = P [a] 
' ’ £>(S) 


is a feasible solution to (2.1). 

Moreover, we have the other direction. 

Lemma 2.3. Consider a feasible solution (X(s, a ))sc[n]:|S|<r,ae[R] s to (2-1). For each S c [n\ with |S| < r, 
define 

P [a] = X(Sa) for each a € [P] s . 

£>(S) v ' 

T/zen (£)(S))sc[i!]:|S|<r forms a consistent collection of local distributions. 

Proof Note that, for each S c n with |S| < r, D(S) is indeed a distribution because by the equality 
constraints of (2.1) 

5^ %«) = x (S',«o= x (0,0) = 1 

ae[R] s 1 ae[R] s a'e[R] s ' 

where S' c S is arbitrary; and moreover Pi)(s)[n] = X(s (Ct ) > 0. Similarly we have, again by the 
equality constraints of (2.1), that for each S' c S and a' € \R] S ' 

lXS'[ a ^ = = ^ X (S,a'oa") = ^ ° a ] 

«"e[R]S\s' a"e[R]S\s' 

so the local distributions are consistent. □ 

When a SA solution (X(s, tt )) is viewed as consistent collection (D(S)} of local distributions, the 
value of the SA solution can be computed as 


1 m 

“E E c '(«)• x (Si,«) 

!=1 ae[R] s i 

where Sc is the support of constraint C. 


E P [a satisfies C] 

CeC a~0(S c ) 


3 Sherali-Adams Integrality Gap for 1F-CSP 

In this section we establish Sherali-Adams integrality gaps for 1F-CSP and by virtue of [13] this 
extends to general LPs. The proof uses the idea of [14] to perform a reduction between problems 
that preserves the Sherali-Adams integrality gap. 

Specifically, we show that the reduction by Bansal and Khot [4] from the unique games problem 
to 1F-CSP also provides a large Sherali-Adams integrality gap for 1F-CSP, assuming that we start 
with a Sherali-Adams integrality gap instance of unique games. As large Sherali-Adams integrality 
gap instances of unique games were given in [14], this implies the aforementioned integrality gap 
of 1F-CSP. 
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3.1 UNIQUE GAMES 

The unique games problem is defined as follows: 

Definition 3.1. A unique games instance TA = (G, [R], n) is defined by a graph G = (V, E) over a 
vertex set V and edge set E, where every edge uv e E is associated with a bijection map n u , v e IT 
such that 7 z U/V : [R] ha [R] (we set n V/U n~l 0 ). Here, [R] is known as the label set. The goal is 
to find a labeling A : V ha [R] that maximizes the number of satisfied edges, where an edge uv is 
satisfied by A if n U/V (A(u)) = A(u). 

The following very influential conjecture, known as the unique games conjecture, is due to 
Khot [31]. 

Conjecture 3.2. For any f,5 > 0, there exists a sufficiently large constant R = R(Q,5) such that the 
following promise problem is NP-hard. Given a unique games instance IE = (G, [R], n), distinguish 
between the following two cases: 

1. Completeness: There exists a labeling A that satisfies at least (1 - Q-fraction of the edges. 

2. Soundness: No labeling satisfies more than 5-fraction of the edges. 

We remark that the above conjecture has several equivalent formulations via fairly standard 
transformations. In particular, one can assume that the graph G is bipartite and regular [32], 

The starting point of our reduction is the following Sherali-Adams integrality gap instances 
for the unique games problem. Note that unique games are constraint satisfaction problems 
and hence here and in the following, we are concerned with the standard application of the 
Sherali-Adams hierarchy to CSPs. 

Theorem 3.3 ([14]). Fix a label size R = 2 C , a real 5 e (0,1) and let A := [C(R/6) 2 ] ( for a sufficiently 
large constant C). Then for every positive e there exists k > 0 depending on e such that for infinitely many 
n there exists an instance of unique games on a A-regular n-vertex graph G = (V, E) so that: 

1. The value of the optimal solution is at most ^ ■ (1 + 6). 

2. There exists a solution to the LP relaxation obtained after r = n K rounds of the Sherali-Adams 
relaxation of value 1 - e. 

3.2 Reduction from unique games to 1F-CSP 

We first describe the reduction from unique games to 1F-CSP that follows the construction in [4]. 
We then show that it also preserves the Sherali-Adams integrality gap. 

Reduction. Let TA = (G, [R], n) be a unique games instance over a regular bipartite graph G = 
(V) W, E). Given TA, we construct an instance I of 1F-CSP. The reduction has two parameters: 
5 > 0 and e > 0, where e is chosen such that eR is an integer (taking e = 2 _ 'i for some integer q > 0 
guarantees this). We then select t to be a large integer depending on e and 5. 

The resulting 1F-CSP instance I will be defined over 2 R | W| variables and c\ V\ constraints, where 
c := c{R, e, t, A) is a function of the degree A of the unique games instance, and the constants R, t and 
£. 3 For our purposes, the unique games integrality gap instance that we start from has constant 
degree A, and hence c is a constant. 

3 More precisely c(R, e, t, A) is exponential in the constants R, t and e, and polynomial in A 
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Before we proceed, we stress the fact that our reduction is essentially the same as the one free 
bit test F Er t in [4], but casted in the language of constraint satisfaction problems. The test F £ft 
expects a labeling A : W h [P] for the vertices of the unique games instance, where each label 
A(iv) £ [R] is encoded using a 2 R bit string. To check the validity of this labeling, the verifier picks 
a vertex v £ V uniformly at random, and a sequence of t neighbors w-\ ,..., w t of v randomly and 
independently from the neighborhood of v, and asks the provers about the labels of {iv\,... ,iv t } 
under the labeling A. It then accepts if the answers of the provers were convincing, i.e., the 
labels assigned to {rtq,..., zv t ] satisfy the edges vw \,..., vw t simultaneously under n V/Wl ,..., n V/Wt 
respectively. 

Instead of reading all of the t2 R bits corresponding to the t labels, the verifier only reads a 
random subset of roughly t2 £R bits and is able to accept with high probability if the labeling was 
correct, and to reject with high probability if it was not correct. In our reduction, the variables of 
the 1F-CSP instance I corresponds to the 2 R bits encoding the labels of each vertex of the unique 
games instance we start from 4 , and the constraints corresponds to all possible tests that the verifier 
might perform according to the random choice of v, the random neighbors and the 

random subset of bits read by the verifier. Instead of actually enumerating all possible constraints, 
we give a distribution of constraints which is the same as the distribution over the test predicates 
of F £/ f. 

We refer to the variables of I as follows: it has a binary variable ( w , x ) for each w £ W and 
x £ {0,1} R . 5 For further reference, we let Var(X) denote the set of variables of I. The constraints of 
I are picked according to the distribution in Figure 1. 

It is crucial to observe that our distribution over the constraints exploits the locality of a unique 
games solution. To see this, assume we performed the first two steps of Figure 1 and have thus 
far fixed a vertex v £ V and t neighbors W\,... ,Wt, and let C v ,wi,...,w t denote the set of all possible 
constraints resulting from steps 3-4 (i.e., for all possible x £ {0,1} R and S C [R] of size ::R). We 
will argue that if there exists a local assignment of labels for {v, iv \,..., w t \ that satisfies the edges 
vvo\,..., vwt, then we can derive a local assignment for the variables {( w , x) : w £ {ay,..., w t ] and x £ 
{0,1} R } that satisfies at least 1 - £ fraction of the constraints in C V/Wu ...,w r This essentially follows 
from the completeness analysis of [4], and is formalized in Claim 3.6. This allows us to convert a 
good Sherali-Adams solution of the starting unique games IF, to a good Sherali-Adams solution of 
the resulting 1F-CSP Instance I. Moreover, in order to show that I is a Sherali-Adams integrality 
gap instance for the 1F-CSP problem, we need to show that OPT(X) is small. This follows from the 
soundness analysis of [4], where it was shown that: 

Lemma 3.4 (soundness). For any e, q > 0 there exists an integer t so that OPT(J) < 7 ] if OPT (flF) < 5 
where 6 > 0 is a constant that only depends on e, 77 and t. 

The above says that if we start with a unique games instance IF with a small optimum then we 
also get a 1F-CSP instance I of small optimum (assuming that the parameters of the reduction are 
set correctly). In [4], Bansal and Khot also proved the following completeness: if OPT (flF) > 1 — C, 
then OPT(T) > 1 - Ct — e. However, we need the stronger statement: if FF has a Sherali-Adams 
solution of large value, then so does I. The following lemma states this more formally, showing 
that we can transform a SA solution to the unique games instance IF into a SA solution to the 
1F-CSP instance I of roughly the same value. 

4 For the reader familiar with hardness of approximation and PCP based hardness, we are using the long code to 
encode labels, so that each of these 2 R bits gives the value of the dictator function / evaluated on a different binary 
string x e {0,1) R ; for a valid encoding we have f(x) = x £ where {is the label that is encoded. 

5 (w,x) should be interpreted as the long-code for A (w) evaluated at x e {0,1) R . 
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1. Pick a vertex v e V uniformly at random. 

2. Pick t vertices W\,... ,Wt randomly and independently from the neighborhood N(v) = 
[w € W : vw € E}. 

3. Pick x e {0,1} R at random. 

4. Let m = eR. Pick indices randomly and independently from [R] and let 

S = {z'i, ..., i m ] be the set of those indices. 

5. Define the sub-cubes: 

C x ,s = {ze {0,1} R : Zj = xj V; t S} 

Cx,s = {ze {0,1} R : z, = Xj V; t S} 

6. Output the constraint on the variables {< Wj,z ) | i € [t], n~^ w .(z) e C X/ s U C x ,s} that is true if 
for some bit b e {0,1} we have 

(iVj, z) = b for all i e [f] and n~J w .(z) € C Xr s, and 

(u>i,z) =b® 1 for all i e [f] and n~j Vi (z) e C X/ s 

where 7i(z) for z 6 {0, 1} R is defined as 7i(z) := (z n (i), z n ( 2 ), ■ ■ ■, z n (R)), and 7i~ ] is the inverse 
map, i.e., n ' (z) e C X/ s is equivalent to saying that there exists y 6 C X/ s such that 7z(y) = z. 


Figure 1: Distribution for the 1F-CSP constraints 


Lemma 3.5. Let {y(S) | S c V U W, |S| < r\ be a consistent collection of local distributions defining a 
solution to the r-rounds Sherali-Adams relaxation of the regular bipartite unique games instance 'll. Then 
we can define a consistent collection of local distributions {<j(S) | S c Var(I), |S| < r} defining a solution to 
the r-rounds Sherali-Adams relaxation of the 1F-CSP instance I so that 


E P [a satisfies C] 
CsC a~a(Sc) 


>(l-e) 1-f 


E 

vweE 


P [A(y) + ^^(A^))] 

( A(v), A(w) ~ f / (( v,iv )) 


where t and e are the parameters of the reduction, and o(Sc) is the distribution over the set of variables in 
the support Sc of constraint C. 


We remark that the above lemma says that we can transform a SA solution to the unique games 
instance TT of value close to 1, into a SA solution to the 1F-CSP instance I of value also close to 1. 

Proof of Lemma 3.5. Let {y(S) | S c V U W, |S| < rj be a solution to the r-rounds SA relaxation 
of the unique games instance U, and recall that I is the 1F-CSP instance obtained from applying 
the reduction. We will now use the collection of consistent local distributions of the unique games 
instance, to construct another collection of consistent local distributions for the variables in Var(X). 

For every set S c Var(J) such that |S| < r, let T$ Q W be the subset of vertices in the unique 
games instance defined as follows: 


T s = {weW:(w,x)eS}. (3.1) 

We construct a(S) from p(Tg) in the following manner. Given a labeling A j s for the vertices in 
Ts drawn from y(Ts), define an assignment as for the variables in S as follows: for a variable 
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(w,x) € S, let € = Aj s (w) be the label of w according to Aj s . Then the new assignment as 
sets cts((w,x)) := 17 . 6 The aforementioned procedure defines a family {a(S)}scvar(J):|S|<r of local 
distributions for the variables of the 1F-CSP instance I. 

To check that these local distributions are consistent, take any S' C S C Var (I) with |S| < r, 
and denote by T$> C T$ their corresponding set of vertices as in (3.1). We know that p(T$) and 
/i(Ts') agree on T$> since the distributions j/i(S)} defines a feasible Sherali-Adams solution for tl, 
and hence by our construction, the local distributions a(S) and a(S') agree on S'. Combining all 
of these together, we get that (a(S) | S c Var(J), |S| < r) defines a feasible solution for the r-round 
Sherali-Adams relaxation of the 1F-CSP instance I. 

It remains to bound the value of this feasible solution, i.e.. 


E P [a satisfies C] 

CeC a~a(Sc) 


(3.2) 


In what follows, we denote by t/>(.) the operator mapping a labeling of the vertices in T$ to an 
assignment for the variables in S, i.e., ip(Aj s ) = a$- 

First note that a constraint C 6 C of the 1F-CSP instance I is defined by the choice of the vertex 
v 6 V, the sequence of t neighbors TV = (w\,..., w t ), the random x 6 { 0 , 1 } R , and the random set 
S c [P] of size c R. We refer to such a constraint C as C(v, TV-,, x, S). Thus we can rewrite (3.2) as 


E 


P [</ ; (^) satisfies C(v, TV IV x, S)1 


(3.3) 


Recall that the assignment i/'(A) for the variables j (w,z) : w € TV Z , and z € {0,1} R J is derived from 
the labeling of the vertices in AV v according to A. It was shown in [4] that if A satisfies the 
edges vw \,..., vw t simultaneously, then ip( A) satisfies C{v, 'W v , x, S) with high probability. This is 
formalized in Claim 3.6, whose proof appears in Appendix B. 

Claim 3.6. If A satisfies vw\,..., vw t simultaneously, then ip(A) satisfies C(v, TV,, x, S ) with prob¬ 
ability at least 1 - e. Moreover, if we additionally have that A(v) i S, then xp( A) always satisfies 
C(v, AV v ,x, S). 

It now follows from Claim 3.6 that for the assignment ip(A) to satisfy the constraint 
C(v, TV-,, x, S ), it is sufficient that the following two conditions hold simultaneously. 

1. the labeling A satisfies the edges vw\,..., vw t ; 

2. the label of v according to A lies outside the set S. 

Equipped with this, we can use conditioning to lower-bound the probability inside the expec¬ 
tation in (3.3) by a product of two probabilities, where the first is 


P [i/>(A) satisfies C(v, TV, x, S)|A satisfies vw \,..., vw t ] (3.4) 

A~ fi ( [v,W\,.. ,,wt ) ),x,S 


and the second is 


P [A satisfies vw \,..., vw t ]. 

A~ n({v,Wi,...,Wt}) 

6 Because (w , x ) is supposed to be the dictator function of the Th coordinate evaluated at x, this is only the correct 
way to set the bit (w, x). 
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Thus using Claim 3.6, we get 


E 

P [a satisfies C] 

> (1 - e) ■ E 

P [A satisfies vw \,.. .,vw t ] 

CeC 

a~o{S c ) 

V,Wi,... r Wt 



( t 






(l- £ ) 1 -Y E 
V i=1 


( t 


= (1 - e) 


i - y E 

V.W-i ... 


i =1 




P [A does not satisfy vzvf 


P [A does not satisfy vzvj] 

A~fx({ v,Wi}) 


(1 - e) ■ 1 - t ■ E 

V,W 


P [A does not satisfy iro] 


(3.5) 

(3.6) 

(3.7) 


where (3.5) follows from the union bound, and (3.6) is due to the fact that the local distributions of 
the unique games labeling are consistent, and hence agree on {v,Wj}. Note that the only difference 
between what we have proved thus far and the statement of the lemma, is that the expectation 
in (3.7) is taken over a random vertex v and a random vertex w £ N(v), and not random edges. 
However, our unique games instance we start from is regular, so picking a vertex v at random 
and then a random neighbor w £ N(v), is equivalent to picking an edge at random from E. This 
concludes the proof. □ 

Combining Theorem 3.3 with Lemmata 3.4 and 3.5, we get the following Corollary. 

Corollary 3.7. For every e, r\ > 0, there exist an arity k and a real k > 0 depending on e and rj such that 
for infinitely many n there exists an instance of 1F-CSP of arity k over n variables, so that 

1. The value of the optimal solution is at most r\. 

2. There exists a solution to the LP relaxation obtained after r = n K rounds of the Sherali-Adams 
relaxation of value at least 1 - e. 

Proof. Let EE - (G, [R], n) be a A-regular unique games instance of Theorem 3.3 that is 6/4-satisfied 
with an n^f -rounds Sherali-Adams solution of value 1 — C, where hq is the number of vertices in 
G. Note that G = (V, E ) is not necessarily bipartite, and our starting instance of the reduction is 
bipartite. To circumvent this obstacle, we construct a new bipartite unique games instance EE' from 
EE that is 6 -satisfied with a Sherali-Adams solution of the same value, i.e., 1 - £■ We will later use 
this new instance to construct our 1F-CSP instance over n variables that satisfies the properties in 
the statement of the corollary. 

In what follows we think of <5, Q and R as functions of £ and i], and hence fixing the latter two 
parameters enables us to fix the constant t of Lemma 3.4, and the constant degree A of Theorem 
3.3. The aforementioned parameters are then sufficient to provide us with the constant arity k of 
the 1F-CSP instance, along with the number of its corresponding variables and constraints, that is 
linear in Uq. 

We now construct the new unique games instance TE' over a graph G' = {V\, W, £') and the 
label set [R] from EE in the following manner: 

- Each vertex v € V in the original graph is represented by two vertices V\, Vi, such that V\ £ Pi 
and V 2 £ W- 


- Each edge e = uv £ E is represented by two edges e\ = it \V 2 and t ’2 = 112 N in E'. The bijection 
maps n U] /V2 and n U2iVl are the same as n U/V . 
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Note that G' is bipartite by construction, and since G is A-regular, we get that G' is also A-regular. 

We claim that no labeling A' : V\ U V 2 [R] can satisfy more than 6 fraction of the edges in tl '. 
Indeed, assume towards contradiction that there exists a labeling A' : V\ U V 2 i-» [R] that satisfies 
at least 6 fraction of the edges. We will derive a labeling A : V 1 —> [R] that satisfies at least 6/4 
fraction of the edges in tl as follows: 

For every vertex v e V, let v\ e V\ and V 2 £ V 2 be its representative vertices in G' . Define 
A(u) to be either A'(ui) or A' (V 2 ) with equal probability. 

Assume that at least one edge of e\ = u \ V 2 and £’2 = iiiV\ is satisfied by A', then the edge e = uv e E 
is satisfied with probability at least 1/8, and hence the expected fraction of satisfied edges in tl by 
A is at least 6/4. 

Moreover, we can extend the r-rounds Sherali-Adams solution of tl {£)(S)}scv : |S|<r/ to a r-rounds 
Sherali-Adams solution {!)' (S)}sci/|UU 2 :|S|-sr for 1A' with the same value. This can be done as follows: 
For every set S = Si U S 2 c V\ U V 2 of size at most r, let S<u c V be the set of their corresponding 
vertices in G and define the local distribution D'(S) by mimicking the local distribution D(S^), 
repeating labels if the same vertex v e S<u has its two copies V\ and V 2 in S. 

Now let I be the 1F-CSP instance over n variables obtained by our reduction from the unique 
games instance 1A’ , where n - 2 r uq. Since OPT(TF') < 6, we get from Lemma 3.4 that OPT(X) < 17 . 
Similarly, we know from Lemma 3.5 that using an n 2 * -rounds Sherali-Adams solution for tl' , we 
can define an u'"-rounds Sherali-Adams solution of I of roughly the same value, where we used 
the fact that R is a constant and hence ^2~ 2RK n 2K ^j > n K for sufficiently large values of tl. This 
concludes the proof. □ 

We have thus far proved that the 1F-CSP problem fools the Sherali-Adams relaxation even after 
n K many rounds for some constant 1 > k > 0 . 

4 LP-hardness of vertex cover and independent set 
4.1 Reduction of LP relaxations 

We will now briefly introduce a formal framework for reducing between problems that is a stripped 
down version of the framework due to Braun et al, with a few notational changes. The interested 
reader can read the details of the full original framework in [10]. 

We start with the definition of an optimization problem. 

Definition 4.1. An optimization problem n = (S, 3) consists of a (finite) set S of feasible solutions and 
a set 3 of instances. Each instance I 6 3 specifies an objective function from S to 1R+. We will denote 
this objective function by Valj for maximization problems, and Cost/ for minimization problems. 
We let OPT(l~) := max S6l s Valj(S) for a maximization problem and OPT(l~) := min s& s Costj(S) for 
a minimization problem. 

With this in mind we can give a general definition of the notion of an LP relaxation of an 
optimization problem IT. We deal with minimization problems first. 

Definition 4.2. Let p > 1. A factor-p LP relaxation (or p-approximate LP relaxation) for a minimization 
problem n = (S, 3) is a linear system Ax > b with .r 6 \R d together with the following realizations: 

(i) Feasible solutions as vectors x s e lR rf for every S e S so that 

Ax s > b for all S e S 
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(ii) Objective functions via affine functions fj : M. d —> R for every I € 3 such that 

fi(x s ) = Costj(S) for all S e S 

(iii) Achieving approximation guarantee p via requiring 

OPT(J) < p LP(J) for all I e 3 

where LP(i~) := min \fj(x) \ Ax > b}. 

Similarly, one can define factor-p LP relaxations of a maximization problem for p > 1. In our 
context, the concept of a (c, s)-approximate LP relaxation will turn out to be most useful. Here, 
c is the completeness and s < c is the soundness. For a maximization problem, this corresponds to 
replacing condition (iii) above with 

(iii)' Achieving approximation guarantee (c, s) via requiring 

OPT(J) < s => LP (J) < c for all I 6 3. 

The size of an LP relaxation is the number of inequalities in Ax > b. We let fc + (n, p) denote the 
minimum size of a factor-p LP relaxation for n. In the terminology of [10], this is the p-approximate 
LP formulation complexity of n. We define fc+(n, c, s) similarly 

In this framework problems can be naturally reduced to each other. We will use the following 
restricted form of reductions. 

Definition 4.3. Let Hi = (d>i, 3i) be a maximization problem and H 2 = (d> 2 , 32 ) be a minimization 
problem. A reduction from ni to YI 2 consists of two maps, one I \ 1 —> 1 2 from 3i to 32 and the other 
Si S 2 from to S 2 , subject to 


Valjj(Si) - pj, - Cjj • Cost j2 (S 2 ) I\ £ 3i,Si € d>i 

where pj 1 is called the affine shift and C j > 0 is a normalization factor. 

We say that the reduction is exact if additionally 

OPT( [h ) = p Xl - Cj, • OPT (1 2 ) 1 1 € 3i . 

The following result is a special case of a more general result by [10]. We give a proof for 
completeness. 

Theorem 4.4. Let ni be a maximization problem and let H 2 be a minimization problem. Suppose that there 
exists an exact reduction from Hi to n 2 with p := pj 1 constant for all I\ 6 3i. Then, fc+{Tli,Ci,si) < 
fc + (n 2 ,p 2 ) where p 2 '■= ^7 (assuming p> c\> Si). 

Proof. Let Ax > b by a p 2 -approximate LP relaxation for n 2 = (d> 2 ,32), with realizations x - for 
S 2 £ d >2 and fj 2 : ]R rf —> R for 1 2 £ 32- We use the same system Ax > b to define a (ci,Si)- 
approximate LP relaxation of the same size for Hi = (d>i,32) by letting x Sl := x Si where S 2 is the 
solution of n 2 corresponding to Si £ d>| via the reduction, and similarly fj } := p - Qj 1 fi 2 with 
Cjj > 0 where J 2 is the instance of n 2 to which I\ is mapped by the reduction and p is the affine 
shift independent of the instance I\. 
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Then conditions (i) and (ii) of Definition 4.3 are automatically satisfied. It suffices to check (in)' 
with our choice of p 2 , for the given completeness C\ and soundness Sj. Assume that OPT(Ji) < Si 
for some instance I\ of Tli. Then 

LP(Ji) = p - Cjj LP(J 2 ) (by definition of , and since Cj, > 0) 

< H - — ■ Q Tl • OPT(J 2 ) (since OPT(J 2 ) < p 2 LP(J 2 )) 

P 2 

p - C\ 

- p 4-— • (OPT(Ji) -p) (since the reduction is exact) 

<si 

p - Cl 

< P + --(Si - p) 

p - Si 

= Cl, 

as required. Thus Ax > b gives a (ci, Si)-approximate LP relaxation of IIi. The theorem follows. □ 

We will also derive inapproximability of independent set from a reduction between maximiza¬ 
tion problems. In this case the inapproximability factor obtained is of the form p 2 = jyprj-- 

4.2 Hardness for vertex cover and independent set 

We will now reduce 1F-CSP to vertex cover with the reduction mechanism outlined in the previous 
section, which will yield the desired LP hardness for the latter problem. 

We start by recasting vertex cover, independent set and 1F-CSP in our language. The two 
first problems are defined on a fixed graph G = (V, E). 

Problem 4.5 (vertex cover(G)). The set of feasible solutions S consists of all possible vertex covers 
(icy, and there is one instance I - 1(H) 6 3 for each induced subgraph H of G. For each vertex 
cover U we have Costj(H)(H) := |U (T V(H)\ being the size of the induced vertex cover in H. 

Note that the instances we consider have 0/1 costs, which makes our final result stronger: even 
restricting to 0/1 costs does not make it easier for LPs to approximate vertex cover. Similarly, for 
the independent set problem we have: 

Problem 4.6 (independent set(G)). The set of feasible solutions S consists of all possible indepen¬ 
dent sets of G, and there is one instance I = 1(H) e 3 for each induced subgraph H of G. For each 
independent set I e S, we have that Valj(H)(f) := |In V(H)| is the size of the induced independent 
set of H. 

Finally, we can recast 1F-CSP as follows. Let n,k e N be fixed, with k < n. 

Problem 4.7 (lF-CSP(n, k)). The set of feasible solutions S consists of all possible variable assign¬ 
ments, i.e., the vertices of the »-dimensional 0/1 hypercube and there is one instance I - I(P) for 
each possible set P = [P],, P m ] of one free bit predicates of arity k. As before, for an instance 
I e 3 and an assignment x 6 {0, lj n , Valj(x) is the fraction of predicates Pj that x satisfies (see 
Definition 2.1). 

With the notion of LP relaxations and 1F-CSP from above we can now formulate LP-hardness 
of approximation for lF-CSPs, which follows directly from Corollary 3.7 by the result of [13]. 
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Theorem 4.8. For every e > 0 there exists a constant arity k = k(e) such that for infinitely many n we 
have fc+(lF-CSP(n, k), 1 - e,e) > n Q ( lo S”/ |o s lo s»). 

Following the approach in [10], we define a graph G over which we consider vertex cover, 
which will correspond to our (family of) hard instances. This graph is a universal FGLSS graph as 
it encodes all possible choices of predicates simultaneously [19]. The constructed graph is similar 
to the one in [10], however now we consider all one free bit predicates and not just the max cut 
predicate x © y. 

Definition 4.9 (vertex cover host graph). For fixed number of variables n and arity k < n we 
define a graph G* = G*(n, k) as follows. Let x\,... ,x n denote the variables of the CSP. 

Vertices: For every one free bit predicate P of arity k and subset of indices S c [n] of size k 
we have two vertices Vp r s ,i and Vp r s r 2 corresponding to the two satisfying partial assignments for P 
on variables x, with i € S. For simplicity we identify the partial assignments with the respective 
vertices in G*. Thus a partial assignment a 6 [0,1} S satisfying predicate P has a corresponding 
vertex vp A e {v Pi s,i, v P iSi2 } ■ 

Edges: Two vertices Vp Al and Vp A2 are connected if and only if the corresponding partial 
assignments a\ and ai are incompatible, i.e., there exists i e Si n S 2 with a\(i) F (Xif)- 

Note that the graph has 2( 2 2 )('f) vertices, which is polynomial in n for fixed k. In order to 
establish LP-inapproximability of vertex cover and independent set it now suffices to define a 
reduction satisfying Theorem 4.4. 

Main Theorem 4.10. For every e > 0 and for infinitely many n, there exists a graph G with | V(G)| = n 
such that fc+ (vertex cover(G), 2 - e) > n n ^°& n l lo & lo & n \ and also fc+ (independent set(G), 1/e) > 

n Q(logn/loglogn)_ 

Proof. We reduce 1F-CSP on n variables with sufficiently large arity k = k(c) to vertex cover over 
G := G*(n, k). For a 1F-CSP instance I\ := I\lfP) and set of predicates P = [Pi,..., P m }, let H(P) 
be the induced subgraph of G on the set of vertices V(P) corresponding to the partial assignments 
satisfying some constraint in P. So V(P) = {vp^j \ P eP,S c [n], |S| < k, i = 1,2}. 

In Theorem 4.8 we have shown that no LP of size at most «°( log log log ") can provide an (1 - e, e)- 
approximation for 1F-CSP for any e > 0, provided the arity k is large enough. To prove that every 
LP relaxation with 2 - £ approximation guarantee for vertex cover has size at least w Q ( lo g”/ lo 8 lo g ”), 
we provide maps defining a reduction from 1F-CSP to vertex cover. 

In the following, let Lb = (2>i,3i) be the 1F-CSP problem and let FI 2 = (S 2 , Lp) be the vertex 
cover problem. In view of Definition 4.3, we map I] - I\(P) to I 2 = IiiFliP)) and let p := 2 and 
Cjj := y where m is the number of constraints in P. 

For a total assignment x 6 d>i we define U = U (x) := {vp A \ a satisfies P and x does not 
extend a}. The latter is indeed a vertex cover: we only have edges between conflicting partial 
assignments, and all the partial assignments that agree with x are compatible with each other. 
Thus I = l(x) := {vp A | a satisfies P and x extends a} is an independent set and its complement 1 1 is 
a vertex cover. 

We first verify the condition that Valj^x) = 2 - ^Costj 2 (lf(x)) for all instances I\ 6 3i and 
assignments x 6 5]. Every predicate P in P over the variables in jx, | i 6 S] has exactly two 
representative vertices Vp Al , Vp A2 where the ct\, «2 £ [0,1} S are the two partial assignments satisfying 
P. If an assignment x 6 2>i satisfies the predicate P, then exactly one of a\, a 2 is compatible with x. 
Otherwise, when P(x) = 0, neither of a.\,ai do. This means that in the former case exactly one of 
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Vp Al ,Vp A2 is contained in U and in the latter both Vp Al and Vp A2 are contained in U. It follows that 
for any I\ - I\(P) E 3i and x E S\ it holds 

Valjj(x) = 2 - ^Cost l2 (U(x)). 

In other words, for any specific P the affine shift is 2, and the normalization factor is 2.. 

Next we verify exactness of the reduction, i.e., 

OPT(id) = 2 - - OPT(J 2 ). 
m 

For this take an arbitrary vertex cover U E S 2 of G and consider its complement. This is an 
independent set, say I. As I is an independent set, all partial assignments a such that Vp A E I 
are compatible and there exists a total assignment x that is compatible with each a with Vp /Cl E I. 
Then the corresponding vertex cover U(x) is contained in U. Thus there always exists an optimum 
solution to 1 2 that is of the form U(x). Therefore, the reduction is exact. 

It remains to compute the inapproximability factor via Theorem 4.4. We have 


P 2 = 


2-e 


2 ~ (1 - e) 


>2-3 £ 


A similar reduction works for independent set. This time, the affine shift is p = 0 and we get 
an inapproximability factor of 

1 - e 1 
P 2 - “ 


> 2e 


for e small enough. 


□ 


5 Upper bounds 

Here we give a size-O(n) LP relaxation for approximating independent set within a factor-0( xfn), 
which follows directly by work of Feige and Jozeph [21]. Note that this is strictly better than the 
n l ~ £ hardness obtained assuming P 4- NP by [26]. This is possible because the construction of our 
LP is NP-hard while being still of small size, which is allowed in our framework. 

Start with a greedy coloring of G = (V, £): let l\ be any maximum size independent set of G, 
let h be any maximum independent set of G - /(, and so on. In general, Ij+\ is any maximum 

independent set of G - h -fy Stop as soon as h U • • • U Ij covers the whole vertex set. Let k < n 

denote the number of independent sets constructed, that is, the number of colors in the greedy 
coloring. 

Feige and Jozeph [21] made the following observation: 

Lemma 5.1. Every independent set I of G has a nonempty intersection with at most [2 x/nj of the color 
classes Ij. 

Now consider the following linear constraints in [R l/ X IR /c - R n+k : 

0 < x v < yj < 1 V; £ [k\, v E Ij (5.1) 

k 

< L2 VwJ - (5.2) 

7=1 
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These constraints describe the feasible set of our LP for independent set on G. Each independent 
set I of G is realized by a 0/1-vector ( x 1 , y 1 ) defined by xj, = 1 iff I contains vertex v and y l . = 1 
iff I has a nonempty intersection with color class Ij. For an induced subgraph H of G, we let 
fl(H)(x,y) := YjveV(H) x v- By Lemma 5.1, (x 1 , if) satisfies (5.1)-(5.2). Moreover, we clearly have 
fl(H)( xl ,y') = \InV(H)\. Let LP(J(H)) := max{f I{H) (x,y) | (5.1), (5.2)} = ma x{LveV(H) x v I (5.1), (5.2)}. 
Lemma 5.2. For every induced subgraph H of G, we have 

LP (1(H)) < [2 yJTi\ OPT(JCK)). 


Proof. When solving the LP, we may assume x v = yj for all j e [k] and all v e Ij. 
be rewritten 


max 


\Ij n V(H)\ ■ yy | 0 < yj 



;=i 


7 = 1 


Thus the LP can 


Because the feasible set is a 0/1-polytope, we see that the optimum value of this LP is attained by 
letting yj = 1 for at most [2 xfn\ of the color classes Ij and if j = 0 for the others. Thus some color 
class Ij has weight at least 1/|_2 yfn\ of the LP value. □ 


By Lemma 5.2, constraints (5.1)-(5.2) provide a size-O(n) factor -O(xfn) LP relaxation of inde¬ 
pendent SET. 

Theorem 5.3. For every n-vertex graph G, /^(independent set(G),2 xfn) < O(n). 

Although the LP relaxation (5.1)-(5.2) is NP-hard to construct, it is allowed by our framework 
because we do not bound the time needed to construct the LP. To our knowledge, this is the first 
example of a polynomial-size extended formulation outperforming polynomial-time algorithms. 

We point out that a factor -)/ 1 LP-inapproximability of independent set holds in a different 
model, known as the uniform model [11, 9]. In that model, we seek an LP relaxation that approxi¬ 
mates all independent set instances with the same number of vertices n. This roughly corresponds 
to solving independent set by approximating the correlation polytope in some way, which turns 
out to be strictly harder than approximating the stable set polytope, as shown by our result above. 


6 LP Hardness for ^-Uniform-Vertex-Cover 

In order to prove LP lower bounds for vertex cover and independent set in Sections 3 and 4, we 
first started by providing a reduction from the unique games problem to the 1F-CSP problem, that 
implied that no small size linear program is a (1 - e, ^-approximation for the 1F-CSP problem. We 
then gave a gap-preserving reduction from any LP approximating 1F-CSP to any LP approximating 
vertex cover, and showed that no small size LP can provide a (2 - ^-approximation for the vertex 
cover problem. 

Our approach for the <j-Uniform-Vertex-Cover will be similar, however our starting point 
is a constraint satisfaction problem different than 1F-CSP. This new constraint satisfaction 
problem, that we refer to as Not-Equal-CSP, is defined as follows: 

Definition 6.1. A CSP of arity k over the domain is referred to as Not-Equal-CSP if each 
constraint P : Zj) —> {0,1} is of the following form 

k 

P A (xi,X 2 , ...,Xic) = 1 if and only if /\( Xi ^ fl i) 

i=i 

7 For convenience, we use the additive group Z, = {0,... , q - 1) instead of [q] as the domain of our CSP. 
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for some A = (a\, ci 2 , ■ ■ ■, af) £ Z|j. When x € Z”, for some n > k, a predicate P := Ps,a is additionally 
indexed by a set S = [z'i, z 2/ ... ,4} £ [n], and Ps,a( x ) = PaCv V 2 , . .., x ik ). 

We remark that the above definition should not be confused with the common Not-AZl-Equal 
predicate. 

Similar to the approach of vertex cover, we shall prove that there is no small linear program¬ 
ming relaxation for ^-Uniform-Vertex-Cover with a good approximation guarantee in two steps. 
In the first step, we prove that no small linear programming relaxation can approximate well the 
Not-Equal-CSP problem. We then give a gap-preserving reduction from this problem to that of 
zj-Untform-Vertex-Cover in the framework of [10]. 

6.1 Sherali-Adams Integrality Gap for Not-Equal-CSP 

This section will be dedicated to proving the following theorem. 

Theorem 6.2. For any e > 0 and integer q > 2, there exist k > 0 and an integer k so that for infinitely 
many n there exists a Not-Equal-CSP instance I ofarity k over n variables satisfying 

- OPT(J) < e; 

— There is a solution to the n K -round Sherali-Adams relaxation of value 1 - 1/q - e. 

The above theorem states that the Not-Equal-CSP problem can fool the Sherali-Adams re¬ 
laxation even after n K many rounds. Before we proceed, we discuss functions of the form 
/ : Z^ i—> {0,1}. These functions will play a crucial role in the analysis. 

6.1.1 Functions Over the Domain Z^ 

In order to construct Sherali-Adams integrality gaps for the Not-Equal-CSP problem, we also 
reduce from the unique games problem. The analysis of this reduction relies heavily on known 
properties regarding functions of the form / : Z^ {0,1}, where Z q is to be thought of as the 
domain of the new CSP, and R as the label set size of the unique games instance. More precisely, 
we exploit the drastic difference in the behavior of functions depending on whether they have 
influential coordinates or not. To quantify these differences, we first need the following definitions. 

Definition 6.3. For a function / : Z^ i-> [0,1], and an index i <E [R], the influence of the 7-th 
coordinate is given by 


Inf iff) = E [Var [f(x) |*i , . . ., Xi—\,Xi+ 1,..., x n J] 


where x n are uniformly distributed. 


An alternative definition for the influence requires defining the Fourier expansion of a function 
/ of the form / : Z ^ h-> {0,1}. To do this, let <po = 1, (pi, ■■■, (pq-i : Z, ? h-> R be such that for all 
i, j £ [q], we have 



Jo if it j 
|l «i = i 
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where the expectation is taken over the uniform distribution, and define the functions (p a : m> ]R 

for every a e Z^ to be 

R 

<t>a(x) ■= ( Pa, fa) 

i=1 

for any x E Z^. We take these functions for defining our Fourier basis. Note that this coincides with 
the boolean case, where for b e {0,1} we have (po(b) = 1, and (pi(b) = (-l/' (or the identity function 
in the {-1,1} domain). For a more elaborate discussion on the Fourier expansion in generalized 
domains, we refer the interested reader to Chapter 8 in [41]. 

Flaving fixed the functions cpo, (pi ,..., (pq-i, every function / : Z^ i-» {0,1} can be uniquely 
expressed as 

fto) = Yj Mato) 

aeZjf 

Equipped with this, we can relate the influence of a variable i E [R] with respect to a function 
/ : Z^ hh> {0,1}, to the Fourier coefficients of / as follows: 

H(/> = E a 


In our analysis we will however be interested in degree-d influences, denoted Inf^(d) and defined 
as 

M?(/> = Y, fl 

a:a;^0,|a|<d 


where |a| in this context is the support of a, i.e., the number of indices j E [R] such that otj 4- 0. 

Observation 6.4 (see, e.g.. Proposition 3.8 in [40]). For a function / : Z^ i—> {0,1], the sum of all 
degree-d influences is at most d. 

We will also need a generalization of the notion of sub-cubes defined in Figure 1 in order to state 
the "It Ain't Over Till It's Over" Theorem [40], a main ingredient of the analysis of the reduction. 
In fact we only state and use a special case of it, as it appears in [52], 

Definition 6.5. Fix e > 0. For x E Z^, and S E C [R] such that |S £ | = eR, the sub-cube C Xr s E is defined 
as follows: 


Cx,s e ■= [ze zj : Zj = Xj V; g S £ } 

Theorem 6.6 (Special case of the It Ain't Over Till It's Over Theorem). For every e, 5 > 0 and integer q, 
there exist & > 0 and integers t, d such that any collection of functions fi,...,ft : Z^ i-» [ 0 , 1 } that satisfies 

V; : E [fj\ > 5 and Vi E [R], VI < h 4 h < t : min {inff )/ Inff (ft 2 )} < 9, 

has the property 


P 

x,S e 


A (fj(Cx A ) = 0) 

i =i 


<5. 


19 




Essentially what this theorem says is that if a collection of t fairly balanced functions are all 
identical to zero on the same random sub-cube with non-negligible probability, then at least two 
of these functions must share a common influential coordinate. In fact all the functions that we 
use throughout this section satisfy a strong balance property, that we denote by folding. 8 


Folded Functions. We say that a function / : {0,1} is folded if every line of the form 

{x £ 2^ | x = a + Al, A £ 2^} contains a unique point where f(x) is zero, where 1 £ 2^ is the all-one 
vector and a £ 2 ^ is any point. 

Remark 6.7. For any folded function / : 2^ i-h> {0,1}, we have that E Y [f(x)\ = 1-1 /q. 

We shall also extend the notion of dictatorship functions restricted to the folded setting. In this 
setting, the Gth coordinate dictator function fg : 2^ i-h> {0,1} for some ( £ [R] is defined as 


m = 


1 if xg + 0 
0 if X( = 0 . 


Notice that fg is folded because it is zero exactly on the coordinate hyperplane {x e Z R 


x { = 0 }. 


Truth Table Model. In order to guarantee the folding property of a function / : 2^ i-h> {0,1} in 
the truth table model, we adopt the following convention: 

- The truth table T f has q R ~ entries in Z l? , one for each x £ Z R such that x\ = 0. 

- For each x £ Z R with X\ = 0, the corresponding entry Y f(x) contains the unique A £ 2^ such 
that f(x + Al) = 0. 

We can however use Yy to query f(x) for any x £ Z R as follows: we have f(x) = 0 whenever 
Y f(x - xil) = Tf(0,X2 - x\,. . .,Xr - X\) = x\ and f(x) = 1 otherwise. 

We can now readily extend the notion of the long code encoding to match our definition of 
dictatorship functions. 

Definition 6.8. The long code encoding of an index £ £ [R] is simply Y j f , the truth table of the 

R-l 

folded dictatorship function of the f-th coordinate. Similarly, the long code Yy £ 2^J is indexed 
by all x £ 2^ such that X\ = 0. 


6.1.2 Reduction from unique games to Not-Equal-CSP 

We first describe the reduction from unique games to Not-Equal-CSP that is similar in many 
aspects to the reduction in Section 3.2. We then show that it also preserves the Sherali-Adams 
integrality gap. 


Reduction. Let 1A = (G, [R], El) be a unique games instance over a regular bipartite graph G = 
(V) W, E). Given Yf, we construct an instance I of Not-Equal-CSP. The reduction has three 
parameters: an integer q > 2 and reals 5, £ > 0, where £ is chosen such that i R is an integer. We 
then select t to be a large integer depending on e, 5 and q so as to satisfy Lemma 6.9. 

8 We abuse the notion of folding here, and we stress that this should not be confused with the usual notion of folding 
in the literature, although it coincides with standard folding for the boolean case. 
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The resulting Not-Equal-CSP instance I will be defined over \W\q R ^ ] variables and c\V\ con¬ 
straints, where c c(R, e, t, A, q) is a function of the degree A of the unique games instance, and the 
constants R, f, q and e. For our purposes, the unique games integrality gap instance that we start 
from, has constant degree A, and hence c is a constant. 

We refer to the variables of I as follows: it has a variable ( w,z ) 6 Z^ for each w e W and 
z £ Z^ such that zi = 0. For further reference, we let Var(X) denote the set of variables of I. The 
constraints of I are picked according the distribution in Figure 2 on page 21. One can see that 
a constraint C := C(v, , W v ,x,S E ) is then defined by the random vertex v (Fine 1), the t random 
neighbors *W V = {zv i,..., w t } (Fine 2), the random x £ Z^ (Fine 3) and the random subset S e C [R] 
(Fine 4). 


1. Pick a vertex v £ V uniformly at random. 

2. Pick t vertices zv\,... ,W\ randomly and independently from the neighborhood N(v) = 
{w £ W : vw £ £}. 

3. Pick x £ Zf at random. 

4. Fet m = eR. Pick indices i\ ,..., i m randomly and independently from [R] and let 
S £ = {z'i,..., i m ] be the set of those indices. 

5. Output the constraint on the variables {(Wi,z- Zil) | i £ [t], n~^ w .(z) £ C X/ $ e } that is true if 

(w u z - zil) t Zi V 1 < i < t, Vz such that n~^ Wl (z) £ C Xi s e 

where tl(z) for z £ Z^ is defined as n(z) := (z n (i),z 7I ( 2 ), .. .,z„(r)). 


Figure 2: Distribution for the Not-Equal-CSP constraints 

Note that if we think of the variables {w, z) for a fixed w £ W as the truth table of some function 
f w : Z^ i—> { 0 , 1 }, then / is forced to satisfy the folding property. 

We claim that if the starting unique games instance 1A was a Sherali-Adams integrality gap 
instance, then I is also an integrality gap instance for the Not-Equal-CSP problem. Similar to 
Section 3.2, we prove this in two steps; we first show that if OPT (tl) is small, then so is OPT(X). 
Formally speaking, the following holds: 

Lemma 6.9. For every e, q > 0 and alphabet size q > 2 there exists an integer t so that OPT(J) < q if 
OPT (IF) < 5 where 6 > 0 is a constant that only depends on e, rj, q and t. 

Proof. Suppose towards contradiction that OPT(X) > q. As noted earlier, for a fixed w £ IV, we 
can think of the variables (w, z) £ Var(J) as the truth table of a folded function f w : Z^ ha {0,1}, 
where T f w (z) := (w,z). This is possible since the variables (w, z) e Var (I) are restricted to z £ Z^ 
with zo = 0. Given this alternative point of view, define for every vertex zv £ W, a set of candidate 
labels L[w] as follows: 


L[w\ = {i £ [R] : Inf *(f a ) > 3} 
Note that \L[w]\ < d/d by Observation 6.4. 
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For every vertex v eV, and every < W V = {w\,..., w t } c N(V), let 

C v ,-w v := {c v/WviXr s ■■xeZ*, Sc [R] such that |S| = £R} 

A standard counting argument then shows that if OPT (!) > t then at least q/2 fraction of the tuples 
{v,w\,... ,u>t) have at least 77 /2 fraction of the constraints inside C V fw v satisfied. We refer to such 
tuples as good. Adopting the language of folded functions instead of variables, the aforementioned 
statement can be casted as 


P 

xeZ*,S e c[R] 


j\ (fwi ( n v,Wi (Qc,S< )) — 1) 


i =1 


>n/2 


if the tuple (v, W\,..., Wt) is good 


where 


fin (C X/S J) = 1 <=> f(z) = 1 Vz such that n 1 (z) e C X/Se 

From Remark 6.7, we get that E [f w J = 1 - 1/q, and hence invoking Theorem 6.6 on the functions 
fw ■ ■ ■, fw t , where f Wi (x) := 1 - f Wi (n V/JVi (x)), yields that for every good tuple, there exists U t 
1 2 £ {1,2,... ,t] such that f Wf .^ and f W( share a common influential coordinate. Note that this is 
equivalent to saying that there exists j\ € L[wf t ], y '2 £ F[zcv 2 ] such that n v>W( (j\) = u V/W( (jz)- 

We now claim that if OPT(X) > r], then we can come up with a labeling A: VUWh [R] that 

satisfies at least of edges, which contradicts the fact that OPT(TF) < 6 for a small enough value 
of 6 > 0. Towards this end, consider the following randomized labeling procedure: 

1. For every w £ W, let A(w) be a random label from the set L[iv], or an arbitrary label if L[iv] = 0. 

2. For every v 6 V, pick a random neighbor w 6 N(v) and set A(v) = n V/W (A(zv)). 

We can readily calculate the fraction of edges in tl that are satisfied by A. This follows from 
putting the following observations together: 

1. If we pick a random tuple (v, W\,..., Wt), it is good with probability 77 /2. 

2. If (v, W\,..., Wt) is good, and we pick w', w" at random from [iV ],..., zv t }, then with probability 
1 /f 2 the functions f w > and f W " share a common influential coordinates. 

3. If (v, W\,..., Wt) is good, and the functions f w < and f w » share a common influential coordinates, 
then picking a random label to w' and w" from L[w'] and L\zu”] respectively, will satisfies 

n v ,w'{A(w) = nv,w"(A(w") with probability l/(d 2 /& 2 ). 

Hence the expected number of edges satisfied by A in this case is 

P [A(y) = n v , w {A{w))} = 
vzveE 2fl z r z 


□ 

We now show that given an r-rounds Sherali-Adams solution of high value for tl, we can also 
come up with an r-rounds Sherali-Adams solution for I of high value as well. The proof goes 
along the same lines of that of Lemma 3.5, and hence we will try to only highlight the differences. 
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Lemma 6.10. Let {p(S) | S c V U W, |S| < r} foe a consistent collection of local distributions defining a 
solution to the r-rounds Sherali-Adams relaxation of the regular bipartite unique games instance 'Ll. Then 
we can define a consistent collection of local distributions {<j(S) | S c Var(I), |S| < r] defining a solution to 
the r-rounds Sherali-Adams relaxation of the Not-Equal-CSP instance I so that 


where t and e are the parameters of the reduction, and o(Sq ) is the distribution over the set of variables in 
the support Sc of constraint C. 

Proof. Let {p(S) | S c V U W, |S| < r) be a solution to the r-rounds SA relaxation of the unique games 
instance 'Ll, and recall that I is the Not-Equal-CSP instance we get by applying the reduction. 
We will now use the collection of consistent local distributions of the unique games instance, to 
construct another collection of consistent local distributions for the variables in Var (I). 

For every set S c Var(J) such that |S| < r, let T$ c W be the subset of vertices in the unique 
games instance defined as follows: 


E 

CeC 


P [a satisfies Cl 
a~o{S c ) 


>(1 —e)(l--) 1 —f- E 

Cj \ vweE 


P [A(u) + n WiV (A(w))\ 

( A(v), A(w) ~ () v,iv)) 


T s = {weW:(w,x)eS}. (6.1) 

We will now construct a(S) from p(Tg) in the following manner. Given a labeling Aj s for the 
vertices in T$ drawn from p(Ts), define an assignment ns for the variables in S as follows: for a 
variable (w, x) e S, let £ = Aj s (w) be the label of w according to Aj s . Then the new assignment 
as sets as((w,x)) := Y f t (x), where Y f { is the long code encoding of £ as in Definition 6.8. The 
aforementioned procedure defines a family {a(S)}scVar(/):|spr of local distributions for the variables 
of the Not-Equal-CSP instance I. The same argument as in the proof of Lemma 3.5 yields that 
(a(S) | S C Var(X), |S| < r] defines a feasible solution for the r-round Sherali-Adams relaxation of 
the Not-Equal-CSP instance I. 

It remains to bound the value of this feasible solution, i.e.. 


E P [a satisfies C] 

CeC (x~o(Sq) 


E 

,Wt 


P [ii>(A) satisfies C(v, AV v , x, S)1 

A~ f/() ...,U’t)),x,S 


( 6 . 2 ) 


where ip(.) the operator mapping a labeling of the vertices in T$ to an assignment for the variables 
in S, i.e., Aj s ) = as- The following claim, which is in some sense the equivalent of Claim 3.6 in 
the Not-Equal-CSP language, along with the same remaining steps of the proof of Lemma 3.5 will 
yield the proof. 

Claim 6.11. If A satisfies vw \,..., vw t simultaneously, then i/>(A) satisfies C(v, AV v , x, S) with proba¬ 
bility at least (1 - l)( 1 - -y). Moreover, if we additionally have that A(v) i S and Xjp v ) i 1 0, then if>( A) 
always satisfies C(v, AV v , x, S ). 

Equipped with this, we can use conditioning to lower-bound the probability inside the expec¬ 
tation in (6.2) by a product of two probabilities, where the first is 


P [i/>(A) satisfies C(v, 'Wv, x, S)| A satisfies vw \,..., vw t ] (6.3) 

A~ fi (( v,W\..,Wt)),x,S 


and the second is 


P [A satisfies vw\,, vw t ]. 
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Thus using Claim 6.11, we get 


E 

P [a satisfies C] 

>(1- £)(1 - -) ■ E 

P [A satisfies vw\,, vw t ] 

CeC 

a~o{S c ) 




>(1 - £)(1-) • (l - t ■ E P [A does not satisfy vw\ 

q ( v,zv[a ~n(lv,w)) 


The proof of Corollary 3.7 adjusted to the Not-Equal-CSP problem now yields Theorem 6.2. 

6.2 LP-reduction from Not-Equal-CSP to (/-Uniform-Vertex-Cover 

We will now reduce Not-Equal-CSP to (/-Uniform-Vertex-Cover on (/-Uniform hypergraphs with 
the reduction mechanism outlined in Section 4.1, which will yield the desired LP hardness for the 
latter problem. 

We start by recasting (/-Uniform-Vertex-Cover and Not-Equal-CSP in the language of Sec¬ 
tion 4.1. The first problem is defined on a fixed (/-uniform hypergraph H = (V, E). 

Problem 6.12 ((/-Uniform-Vertex-Cover(G)). The set of feasible solutions S consists of all possible 
vertex covers U c V, and there is one instance I = I(H') e 3 for each induced subgraph H' of G. 
For each vertex cover U we have Cost] ( H rfU) := |!i n V(H')\ being the size of the induced vertex 
cover in H'. 

We also recast Not-Equal-CSP as follows. Let n, q, UN be fixed, with k < n. 

Problem 6.13 (Not-Equal-CSP(h, q, k)). The set of feasible solutions S consists of all possible 
variable assignments, i.e., all possible values of and there is one instance I = I(P) for each 
possible set P - {Pi,..., P m } of Not-Equal-CSP predicates of arity k. As before, for an instance 
I e 3 and an assignment x <E Z”, Valj(x) is the fraction of predicates P, that x satisfies (see 
Definition 6.1). 

With the notion of LP relaxations and Not-Equal-CSP from above, we can now formulate 
LP-hardness of approximation for Not-Equal-CSPs, which follows directly from Theorem 6.2 by 
the result of [13] (See the discussion in [13] and Section 7 in [37]). 

Theorem 6.14. For every e > 0 mid alphabet size q > 2, there exists a constant arity k = k(e) such that for 
infinitely many n we have fc+(N ot-Equal-CSP(;j, q,k), 1 - 1/q - e,e) > h q ( 1o s "/ lo g lo g n ). 

Similar to Section 4.2, we first define our host hypergraph, and then provide a reduction that 
will yield our hardness result for (/-Uniform-Vertex-Cover using Theorem 4.4. 

Definition 6.15 ((/-Uniform-Vertex-Cover host hypergraph). For fixed number of variables n, 
alphabet q, and arity k < n we define a hypergraph H* = H*(n, q, k) as follows. Let X \,..., x„ denote 
the variables of the CSP. 

Vertices: For every subset S = [z'i, ..., 4} C [n], and every value of A = {a\,... ,a k ) e Zjj, we have 
a vertex v$,a corresponding to the Not-Equal-CSP predicate 


k 

P(x hr , ..., x ik ) = 1 if and only if f\(x i] ± Uj) 

7=1 
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Hyperedges: Any q vertices vs 1 ,a 1 ,■■■, v s v A q are connected with a hyperedge if there exists a 
variable x, 6 Hy =1 Sy, such that t fl, 2 t t a lq/ where a Vj is the entry of the vector Aj that is 
compared versus the variable X; in the predicate defined by the pair ( Sj,Aj ). In other words, we 
have a hyperedge connecting q vertices sharing a common variable Xj, if no two of their corresponding 
predicates check x, versus the same a E Z q . 

Note that the graph has q k Q) vertices, which is polynomial in n for fixed k and q. In order to 
establish LP-inapproximability of (/-Uniform-Vertex-Cover it now suffices to define a reduction 
satisfying Theorem 4.4. 

Main Theorem 6.16. For every e > 0,q > 2 and for infinitely many n, there exists a hypergraph H with 
\V(H)\ = n such that fc+ (^-Uniform-Vertex-Cover(H), (7 - £)> n Q ( logfI / loglog "). 

Proof. We reduce Not-Equal-CSP on n variables of alphabet Z q with sufficiently large arity k = k(e) 
to ^-Uniform-Vertex-Cover over H := H*(n,q,k). For a Not-Equal-CSP instance I\ = I\{P) and 
set of Not-Equal predicates P = {Ps 1 ,a 1 ,Ps 2 A 2 > ■ • •, Ps„„A„,}, let H(P) be the induced subgraph of G 
on the set of vertices V(P) = {vs u a, | 1 < i < m}. 

Similarly to Section 4.2, we provide maps defining a reduction from Not-Equal-CSP to q- 
Uniform-Vertex-Cover. The proof will then follow by combining Theorems 6.14 and 4.4. 

In the following, let Tli = (S\, 3i) be the Not-Equal-CSP problem and let YI 2 = {S 2 , 3 2 ) be the 
^-Uniform-Vertex-Cover problem. In view of Definition 4.3, we map I\ = I\{P) to 1 2 = I 2 (H(P)) 
and let p := 1 and Cj, := pj where m is the number of constraints in P. 

For a total assignment x € *Si we define U = U (x) := [vs,a '■ Ps,a( x ) = 0}- The latter is indeed 
a vertex cover. To see this, consider its complement I = I(x) := {v$ r A I Ps,a( x ) = !}■ Since x 
satisfies all the constraints corresponding to vertices in I simultaneously, no hyperedge can be 
completely contained in I. Otherwise this would imply that there exists a variable x,-, and q 
predicates P 1 , P' 2 ,... ,P' q e P requiring x, + j for all j E Z q , and yet are all simultaneously satisfied 
by x. 

We first verify the condition that Valj^x) = 1 - ^Costj 2 (U(x)) for all instances I\ E 3i and 
assignments x e *Si. Every predicate Ps,a in P over the variables in {x,- | i E S} has exactly one 
representative vertex Vs : a, that will be inside U only if Ps,a (x) = 0 , and hence our claim holds. In 
other words, for any specific P the affine shift is 1, and the normalization factor is T. 

Next we verify exactness of the reduction, i.e., 


OPT(id) = 1 - — OPT(J 2 ). 
m 

For this take an arbitrary vertex cover U E S 2 of H and consider its complement. This is an 
independent set, say I. As I is an independent set 9 , we know that for any variable X( in Uu S4 e/ S, 
there exist a least one a X( E Z q such that Xf is not checked versus a X{ in any of the predicates 
corresponding to vertices in I. Elence any assignment x setting each Xf to d X[ as defined earlier, sets 
Ps,a( x ) = 1 f°r all vs,a € I. Then the corresponding vertex cover U (x) is contained in U. Thus there 
always exists an optimum solution to 1 2 that is of the form U(x). Therefore, the reduction is exact. 
It remains to compute the inapproximability factor via Theorem 4.4. We have 


P 2 


1 — £ 

1 -(1-1/q-e) 


>q- ©(£) 


□ 

9 In a hypergraph H = (V, E) a set I c V is said to be independent if no hyperedge of H is fully contained in I. 
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7 SDP-Hardness for independent set 


We saw in Section 4.2 how to obtain an LP-hardness for vertex cover and independent set, starting 
from an LP-hardness for the 1F-CSP problem. Restricting our starting CSP to have only one free 
bit is crucial for the vertex cover problem, since each constraint is then represented by a cloud 
containing exactly two vertices in the resulting graph. In this case, an assignment satisfying almost 
all the constraints, corresponds to a vertex cover containing slightly more than half of the vertices 
(i.e., one vertex in almost all the clouds, and both vertices in the unsatisfied clouds), whereas if 
no assignment can simultaneously satisfy more than ^-fraction of the constraints, then any vertex 
cover should contain almost all the vertices. This extreme behaviour of the resulting graph is 
necessary to obtain a gap of 2 for the vertex cover problem. 

However, if we are only interested in the independent set problem, any CSP with a sufficiently 
large gap between the soundness and completeness can yield the desired LP-Hardness, by virtue 
of the well-known FGLSS reduction [20]. Formally speaking, given reals 0 < s < c < 1, and any 
CSP problem n(P, n, k), where n is the number of variables and P is a predicate of arity k, and 
knowing that no small linear program can provide a (c, s)-approximation for this CSP, then one 
can show that no small LP can as well approximate the independent set problem within a factor 
of c/s. This can be simply done by tweaking the reduction of Section 4.2 in a way that the number 
of vertices in each cloud is equal to the number of satisfying assignments for the predicate. Hence 
dropping the one free bit requirement, and restricting ourselves to CSPs such that c/s = 1/e for 
arbitrarily small e := e(k) > 0, would yield the desired co( 1) LP-hardness for the independent set 
problem. 

Moreover, the reduction framework of [10] and our construction in Section 4.2 are agnostic to 
whether we are proving LP or SDP lower bounds, and hence having an analog of Theorem 4.8 in 
the SDP world would yield that any SDP of size less than n Q ( lo g”/loglogn) p as an integrality gap 
of a>(l) for the independent set problem. In fact such SDP-hardness results for certain families 
of CSPs and hence an analog of Theorem 4.8 are known: if our starting CSP has a predicate that 
supports pairwise independence with a sufficiently large arity k, then the result of [5] by virtue of 
[37] gives us the desired SDP base hardness. By the argumentation from above we obtain: 

Corollary 7.1. For every e > 0 and for infinitely many n, there exists a graph G with |V(G)| = n, such that 
no polynomial size SDP is a (1 / e)-approximate SDP relaxation for independent set(G). 

8 Discussion of related problems 

We believe that our approach extends to many other related problems. As proved here, it applies 
to ^-Uniform-Vertex-Cover. Moreover, we would like to stress that our reduction is agnostic to 
whether it is used for LPs or SDPs and Lasserre gap instances for 1F-CSP, together with [38] and 
our reduction would provide SDP hardness of approximation for vertex cover. This already holds 
for the independent set problem as we saw in Section 7, since the starting CSP does not need to 
have only one free bit, as long as the gap between the soundness and completeness is sufficiently 
large. 

Note that we are only able to establish hardness of approximations for the stable set problem 
within any constant factor, while assuming P F NP one can establish hardness of approximation 
within fi 1-£ . The reason for this gap is that the standard amplification techniques via graph 
products do not fall into the reduction framework in [10]. Also, there will be limits to amplification 
as established by the upper bounds in Section 5. 
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Finally, we would like to remark that our lower bounds on the size can be probably further 
strengthened, however, with our current reductions this would require a strengthened version of 
the results in [13]. 
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A Definition of Sherali-Adams for General Binary Linear Programs 


For completeness, we give the general definition of the r-rounds SA tightening of a given LP, and 
then we show that for CSPs the obtained relaxation is equivalent to (2.1). 

Consider the following Binary Linear Program for c € IR", A e IR'" X " and b e R mxl : 


n 

max ^ CjX t 

i=i 

s.t. Ax < b 

xe {0,1}”. 

By replacing the integrality constraint with 0 < x < 1, we get an LP relaxation. 

Sherali and Adams [50] proposed a systematic way for tightening such relaxations, by refor¬ 
mulating them in a higher dimensional space. Formally speaking, the r-rounds SA relaxation is 
obtained by multiplying each base inequality E/=i A ijXj < bj by ELeS x s IlfeT^ - x t) for all disjoint 
S,T c [n] such that |S U T| < r. This gives the following set of polynomial inequalities for each such 
pair S and T: 


y AijXj 

JAn] 


n Xs n* - ^n xs 

j seS £gT seS t€T 


Vi € [ m \, 


0 < xj ]^[ ]^[(1 - x t ) < 1 V; e [n]. 

seS teT 

These constraints are then linearized by first expanding (using x ? = Xj, and thus x,(l - x,) = 0), and 
then replacing each monomial IlfeH x i by a new variable i/h, where H c [n] is a set of size at most 
r. Naturally, we set r/g := 1. This gives us the following linear program, referred to as the r-rounds 
SA relaxation: 


max 


s.t. 


n 


Yj c m 

i= 1 


L<-d ihi E A-yT/HUSup 


Her 


\jAn] 


< 


bi'Yi-^y 


HUS 


HcT 


0 < ^(-l) |H| 3/HuSuj;) < 1 

HCT 

1/0 = 1 


\/ie[m],S,T, 

V;e[n],VS,T, 


where in the first two constraint we take S,T c [n] with SnT = 0 and |S U T\ < r. 

One could go back to the original space by letting x, = i/|,| and projecting onto the x, however 
we will refrain from doing that, in order to be able to write objective functions that are not linear 
but d egree-/c polynomials, as is natural in the context of CSPs of arity k. Since we need to do k 
rounds of SA before even being able to write the objective function as a linear function, it makes 
more sense to work in higher dimensional space. 

For constraint satisfaction problems, the canonical r-rounds SA relaxation is defined as 
follows. Consider any CSP defined over n variables X\, ...,x n € [R], with m constraints C = 
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{Ci,...,C m } where the arity of each constraint is at most k. For each j £ [/;] and u £ [R], we 
introduce a binary variable x(j, u), meant to be the indicator of Xj = u. Using these variables, the 
set of feasible assignments can naturally be formulated as 

Y x (b u ) = 1 V; £ [n ], 

I/6[R] 

x(j,u)e{ 0,1} V; £ [u], u £ [R]. 

If we relax the integrality constraints by, for each j £ [«], u £ [R], replacing x(j, u) £ {0,1} by 
x(i, u) > 0 (we omit the upper bounds of the form x(j, u) < 1 as they are already implied by the 
other constraints) then we obtain the following constraints for the r-rounds SA relaxation : 

E<-« IHI E J/HUSUfO»} - E (- 1 ) |h| i/hus V; € [n],S,T, 

HCT »e[R] Her 


X(-l) |H| yHu S u, (;/!( ), ^0 V(;,u) £ [n] x [R],S,T, 

HCT 

where we take S, T c [n] x [R] with S n T = 0 and |S U T| < r. 

To simplify the above description, we observe that we only need the constraints for which 
T = 0. 

Claim A.l. All the above constraints are implied by the subset of constraints for which T = 0. 

Proof. The equality constraints are easy to verify since L»e[R] J/su{(;,u)l = }/s for all S c [n] X [R] with 
|S| < r implies 

E (-1)' 1 ™' £ yaw, = £ (-i) lsnT| y„. 

SCHcSuT we[R] ScHcSuT 

Now consider the inequalities. If we let T = {(/i. Mi), (j 2 , uf), ■ ■ ., (jt, U{)} then by the above 
equalities 

2^( _1 ) |H| J/HUSU{(;,«)) = ^ (-l) |H| 3/HuSut(;,u)} ~ ^ (-l) |H| 1/HuSU((;,u),(;i,« 1 )} 

HCT HcTMOTuP) HCT\{(j 1/Ul )} 

= Y Y (- 1 ) |H| yHuSU{(;,«),(;i,up) 

u;e[R]:u^U! HCT\((; 1/Ml )l 


E - E 

u' t e[R]:u' t ?u t 


ySU{(y,M),(;i,Uj).(/,,«;)} - 


Hence, we have also that all the inequalities hold if they hold for those with T = 0 and S such that 
|S| < r. □ 

By the above claim, the constraints of the canonical r-rounds SA relaxation of the CSP can be 
simplified to: 

Y ysu{( 7 ,«)) = ys V; € [n],S c [n] x [R] : |S| < r, 

ue[R] 
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V(;,u) € [h] x [R], S c [n\ x [R] : |S| < r. 


ysD{(j,u)} > o 

To see that this is equivalent to (2.1) observe first that y$ = 0 if { (j, u'), ( j , it")} C S. Indeed, by the 
partition constraint, we have 

T. yih“'W;>")luKi,«)) = x J{(j,u'),{j,u")) > 

ueR 

which implies the constraint 2 y{(j,n'),{j,u")\ ^ )• This in turn (together with the non¬ 

negativity) implies that y\(j, U '),(b u ")\ = 0- Therefore, by again using the partition constraint, we have 
ys = 0 whenever {(/, u'), (j, u”)\ c S and hence we can discard variables of this type. We now obtain 
the formulation (2.1) by using variables of type R({j 1 ,...q t },(xi lr ...,u t )) instead of y{(j lMr ),(j 2M2 ),...,(j t ,u l )V The 
objective function can be linearized, provided that the number of rounds is at least the arity of the 
CSP, that is r ^ k, so that variables for sets of cardinality k are available. 

B Proof of Claim 3.6 

Proof of Claim 3.6. Assume that A satisfies VW\,..., vw t simultaneously, i.e., 

tt i , /K , 1 (A(wi)) = ■ ■ ■ = n VrWt (A(w t )) = A(v) (B.l) 

and let C X/ s and C X/ s be the sub-cubes as in Figure 1. According to the new assignment, every 
variable (wi, z) in the support of C(v,'W v , x, S) takes the value z^ Wi y Assume w.l.o.g. that (w u z) is 
such that Tzfl,.(z) € C X/ s, and let y € C X/ s satisfies n V/Wj (y) = z. Then we get 

z A(7Vi) = n v,Wi(y)A(wi) = Vn v , Wi (A(wi)) ~ 3/A(d) (B-2) 

where the last equality follows from (B.l). We know from the construction of the sub-cube C X/ g 
that for all j £ S and for all y 6 C X/ $, we have y, = Xj. It then follows that if A(v) i S, equation B.2 
yields that 

ZA ( Wi ) = y A(v) = X A (v) V (W if z) S.t. TZ~} W .(Z) € C X , S 

Similarly, for the variables (zty,z) with n~^ w (z) € C X/ s, we get that 

z A(wi) ~ y A (v) — ^A(n) V ( Wj,Z) S.t. n VrW .(z) 6 C X/ s 

Thus far we proved that if If A satisfies vw\,..., vwt simultaneously and A(v) £ S, then tp(A) satisfies 
C(v, f W V/ x, S). But we know by construction that |S| = eR, and hence A(v) t S with probability at 
least 1 - e. □ 

C Proof of Claim 6.11 

Proof of Claim 6.11. Assume that A satisfies vw\,..., vw t simultaneously, i.e., 

n VrWl (A{w{)) = • • • = n V/Wl (A(w t )) = A(v) (C.l) 

and let C r< s, be the sub-cube as in Figure 2. For z € [q] R with 7i“^.(z) € C X/ s e , let y € [q] R be such 
that n VrWi {y) = z. Recall that a constraint C(v, AV v , x, S e ) looks as follows: 

(wi, z © z.) t z o V 1 < i < t, Vz such that n~j v fz) e C X/ $ e (C.2) 
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We now adopt the functions point of view, i.e., for a w € W, the variables (w, z) for z 6 [q] R with zo 
are the entries of the truth table of a function f w , and according to the new assignment A, f w is the 
folded dictatorship function of the label of A (w). 

So if we let / := f Wi for some 1 < i < t, and z := <oy,z), we get that 

(Wi, z ® z) t zo <=> f(z) + 0 

and by our definition of the dictatorship function, the latter is zero iff z^ Wi ) = 0. But 

Z A (Wi) = n v,Wi(y)A(wi) = y n V:W .(A(wi)) = y A{v) (c.3) 

where the last equality follows from (C.l). We know from the construction of the sub-cube C Xr s £ 
that for all j £ S E and for all y £ C X/ s £ , we have yj = Xj. It then follows that if A(v) S e , equation 
C.3 yields that 

ZA(wi) ~ yA(v) — %A(v) V (jVi/Zy S.t. Tl v w .(z) £ C X/ S £ 

Moreover, given that x is chosen uniformly at random from \(]] R , we get that for any i £ [R], 
!*«[<,]* = °] = \■ 

Thus far we proved that if If A satisfies vw \,..., vw t simultaneously and A(u) S, then ip( A) 
satisfies C(v, TfA, x, S) with probability 1 - -y. But we know by construction that |S| = i R, and hence 
A(v) <£ S with probability at least 1 - e. □ 
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